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Comment on "Tests of scaling and universality..." 
by Plerou and Stanley, Phys. Rev. E 76,046109 

In a pioneering study Plerou et al. [1] extended the 
statistical analysis of financial data on the distribution of 
the volume q, the number of shares exchanged in a single 
transaction. They concluded that this decays asymptot- 
ically as a power law, P(q > x) oc x~^ q with Q q ps 1.5, 
a value in the Levy stable regime. A later study in Ref. 
[2], however, found significantly higher values for data 
aggregated over 15 minutes, typically £q ps 2.3, outside 
the Levy regime. 

In a recent Article [3], Plerou and Stanley extended 
their analysis and found ( q ~ 1.49 — 1.65 for three differ- 
ent stock markets. Ref. [3] argued that the discrepancy 
between the previous works [1,2] in determining the tail 
exponents could be traced back to the use of the Hill's 
estimator (HE) [4], which contains a cutoff parameter 
that is usually determined in a qualitative fashion. In 
contrast, Ref. [3] uses the Meerschaert-Scheffler estima- 
tor (MSE) [5], which is cutoff and shift independent and 
defined as 
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qi are independent observations of q, and (q) is its mean, 
7 = 0.5772 is Euler's constant and ln + (a;) = max(0, In a;). 

The use of the MSE raises several problems in this 
specific case. First of all, the MSE is an estimator of the 
tail-index only if the index is less than 2, otherwise (i.e., if 
the variance is finite) it converges to 2.0. In addition, the 
rate of convergence is quite slow compared to HE. These 
two factors result in the MSE being an questionable tool 
to distinguish between the cases C > 2 and £ < 2. Fur- 
thermore, this estimator is not invariant to changes of 
the scale: for any fixed A > 1, Q% > (™ s . In [3] the 
data are rescaled with the mean absolute deviation. 

In order to demonstrate that the MSE based proce- 
dure yields misleading results, we used it on computer- 
generated Pareto distributed data with different tail ex- 
ponents. Sample sizes were chosen to be comparable with 
that of financial data considering liquid stocks on NYSE 
in the period studied in Ref. [3] (1994-1995), where 
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1.65 ± 0.01 was concluded. Our results are shown 


in Fig. 1. 

The HE has no limitation for the value of the tail expo- 
nent to be determined, however, as mentioned in [3], the 
estimation depends on the cutoff, at least in the original 
version [4] . In a recent improvement [6] this problem was 
resolved by using a self-consistent optimization scheme. 
We have found that the problem of shift-dependence [2] 
could also be settled in a similar fashion. Using the im- 
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Figure 1: (color online) MSE estimates of the tail exponent of 
generated Pareto distributed data. Each point corresponds to 
the average of 100 runs. The sample sizes are indicated in the 
legend. The solid black line shows the asymptotic (N — > 00) 
behavior of the MSE. The empirical value of £ 9 — 2.02 with 
the errors as calculated from the self consistent Hill estimator 
and the corresponding MSE values are indicated (green and 
red lines). 


proved Hill schemes we determined the tail exponents for 
the volume distribution of transactions for the 1000 most 
liquid stocks on the NYSE for the same period as studied 
in [3]. The obtained results are significantly higher than 
those of [3]. We find an average exponent ( q = 2.02±0.45. 
The corresponding values are shown in Fig. 1 and it 
is indicated how such measured values result in smaller 
MSE estimates. For the aggregated data with a time 
window of 15 min we find Cq ~ 2.3, in agreement with 
[2]- 
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